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Abstract 

The ability to estimate the fraction of ground flashes in a set of flashes observed by a satellite lightning 
imager, such as the future GOES-R Geostationary Lightning Mapper (GLM), would likely improve 
operational and scientific applications (e.g., severe weather warnings, lightning nitrogen oxides studies, 
and global electric circuit analyses). A Bayesian inversion method, called the Ground Flash Fraction 
Retrieval Algorithm (GoFFRA), was recently developed for estimating the ground flash fraction. The 
method uses a constrained mixed exponential distribution model to describe a particular lightning optical 
measurement called the Maximum Group Area (MGA). To obtain the optimum model parameters (one of 
which is the desired ground flash fraction), a scalar function must be minimized. This minimization is 
difficult because of two problems: (1) Label Switching (LS), and (2) Parameter Identity Theft (PIT). The 
LS problem is well known in the literature on mixed exponential distributions, and the PIT problem was 
discovered in this study. Each problem occurs when one allows the numerical minimizer to freely roam 
through the parameter search space; this allows certain solution parameters to “interchange roles” which 
leads to fundamental ambiguities, and solution error. A major accomplishment of this study is that we 
have employed a state-of-the-art genetic-based global optimization algorithm called Differential 
Evolution (DE) that constrains the parameter search in such a way as to remove both the LS and PIT 
problems. To test the performance of the GoFFRA when DE is employed, we applied it to analyze 
simulated MGA datasets that we generated from known mixed exponential distributions. Moreover, we 
evaluated the GoFFRA/DE method by applying it to analyze actual MGAs derived from low-Earth 
orbiting lightning imaging sensor data; the actual MGA data were classified as either ground or cloud 
flash MGAs using National Lightning Detection Network™ (NLDN) data. Solution error plots are 
provided for both the simulations and actual data analyses. 
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1. GOAL OF R3 PROJECT 


When viewing lightning from space at optical wavelengths, the cloud multiple scattering medium obscures the view thereby preventing one 
from easily determining what flashes strike the ground. However, recent studies have made some progress in estimating the ground flash 
fraction, a > in a set of N flashes observed from space [Koshak (2010), Koshak and Solakiewicz (201 1), and Koshak (2011)]. Knowledge of 
a is important for better understanding: 

• Severe Weather, 

• Lightning Nitrogen Oxides Chemistry/Climate Studies, and 

• Global Electric Circuit. 

In the study by Koshak (2011), a Bayesian inversion method was introduced for retrieving the 
fraction of ground flashes in a set of flashes observed from a (low earth orbiting or geostationary) 
satellite lightning imager. This method has formed the basis of a Ground Flash Fraction Retrieval 
Algorithm (GoFFRA) that is being tested as part of GOES-R Geostationary Fightning Mapper 
(GEM) risk reduction. Figure 1 summarizes the basic question addressed in this study. 


2. THE BAYESIAN RETRIEVAL 
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Figure 2 highlights the mathematical attributes of the Bayesian retrieval process. The ^ ' s are the population mean Maximum Group Areas 
(MGAs) decremented by the nadir footprint area of the space-based lightning imager employed. In summary, the distribution of y (nadir 
footprint decremented MGA) is assumed to follow a mixed exponential distribution; one finds the 3 parameters of this distribution that 
maximizes a scalar function which is a product of probabilities. 


Bayesian Inversion 


Bayes' Law 


h |y)= 


Pty a r ti ( .p)IXa.fi r tL) 


Flrd parameters v ■ that maximize the probability on LHS 

This m#ans one maximizes the renewing 

Formally : 

a ^ v ' 0 => v ■'Maximum A Pusterion (MAP) Solution' 
dv 

Practically 

fit BrG¥idcn-K]e{ctic?-(jGld(ivb-$}iJitnen vnrunl ef linvidwi-l'tetlier-PweH numcricMl mtiticd Iw 
in inimijfic vl Alla, ii lunpiilicd by m ni min g .mode L pnrumacr independence, with Ptp) 
uniform, tnd ) It PljtJ belli normil diftributieni 


Grobner Initialization 

_ y - n c 
V g ~.“c 

-B-JB--4AC 

2A 

-B + fB'--4AC 


f<7‘ ♦*’> 

<■= ur +27**+//) 



Figure 2. Left slide summarizes the Bayesian retrieval approach. To obtain the optimum parameters, the optimization begins by using an initial guess of the 
solution, such as the “Grobner Initialization” (right slide). 


3. ACCOMPLISHMENTS 


4. PERFORMANCE TESTS 


a. Simulated Data To test the performance of the GoFFRA when DE is employed, we applied 
it to analyze simulated MGA datasets that we generated from known mixed exponential 
distributions. The solution retrieval errors are provided in Figure 3. The retrieval errors are 
shown as a function of the true ground flash fraction (alpha) of the mixture; vertical lines are 
standard deviations about the mean retrieval errors. The algorithm performs quite well. 


b. OTD Data Moreover, we evaluated the GoFFRA/DE method by applying it to analyze 
actual MGAs derived from the low-Earth orbiting lightning imager called the Optical Transient 
Detector (OTD). The actual MGA data were classified as either ground or cloud flash MGAs 
using National Fightning Detection Network™ (NFDN) data. The ground flash fraction 
retrieval error plots are provided in Figure 4. Errors increase for large alpha (~ 0.7) indicating 
a breakdown in the mixed exponential distribution model for the ground flash MGAs. This is 
an important finding, as it requires us to upgrade the form of the model to further reduce the 
retrieval errors. 
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Figure 3. Retrieval errors for simulated data. 


N = 5000 


c. LIS Data We have also made significant progress in creating test data 
sets based on LIS data, and evaluating algorithm performance. To create 
the test dataset, we inter-compared 9 years (2003-2011) of LIS and NLDN £ 

over CONUS and thereby categorized each LIS flash as either a ground or i5 

cloud flash. Figure 5 shows the location of the LIS ground (red) and LIS ° 

cloud (blue) flashes over the US. The distributions of the LIS ground flash % 

and cloud flash MGAs are the remaining figures in Figure 5. Application 
of the GoFFRA algorithm to mixtures of the FIS ground and cloud flashes 
produced even larger retrieval errors than for OTD, with errors again 
increasing for large values of alpha. Even though the ground flash fraction 
in nature is normally small (i.e., where our algorithm performs the best), we 
want to upgrade the mixed exponential distribution model to reduce retrieval 
error across the entire alpha domain. 

Figure 4. Retrieval errors using mixtures of OTD ground & cloud flashes. 
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Figure 5. The new test dataset created by comparing 9 yrs of LIS and NLDN data. Leftmost plot is the locations of LIS ground (red) & cloud (blue) flashes. 


In addition to developing the Bayesian formalism for the retrieval problem, introducing the use of the mixed exponential model, and deriving 
the Grobner Initialization, we have conducted preliminary tests of the GoFFRA on simulated datasets. This allowed us to uncover solution 
errors due to Fabel Switching (FS) and Parameter Identity Theft (PIT). The FS problem is well known in the literature on mixed exponential 
distributions, and the PIT problem was discovered in this study. Each problem occurs when one allows the numerical minimizer to freely roam 
through the parameter search space; this allows certain solution parameters to “interchange roles” which leads to fundamental ambiguities, and 
solution error. A major accomplishment of this study is that we have employed a state-of-the-art genetic-based global optimization algorithm 
called Differential Evolution (DE) that constrains the parameter search in such a way as to remove both the FS and PIT problems. We have 
also applied the GoFFRA/DE method to both simulated and actual MGA datasets (see following section) to evaluate algorithm performance. 


□ Accomplishments Anticipated @ end of Year 2: We will upgrade/replace the mixed exponential model, perform retests, and show reduced retrieval errors. 

□ Initial Feedback from Potential Users: Given our results above, we anticipate the start of getting feedback regarding specific GoFFRA applications. 

□ What Could be Accomplished w/3 rd Year Funding: Completion of additional error analyses showing robustness of algorithm; more elaborate demos. 
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